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Abstract 

The question how an agent is affected by its embodiment 
has attracted growing attention in recent years. A new 
field of artificial intelligence has emerged, which is based 
on the idea that intelligence cannot be understood with¬ 
out taking into account embodiment. We believe that 
a formal approach to quantifying the embodiment’s ef¬ 
fect on the agent’s behaviour is beneficial to the fields of 
artificial life and artificial intelligence. The contribution 
of an agent’s body and environment to its behaviour is 
also known as morphological computation. Therefore, in 
this work, we propose a quantification of morphological 
computation, which is based on an information decompo¬ 
sition of the sensorimotor loop into shared, unique and 
synergistic information. In numerical simulation based 
on a formal representation of the sensorimotor loop, we 
show that the unique information of the body and envi¬ 
ronment is a good measure for morphological computa¬ 
tion. The results are compared to our previously derived 
quantification of morphological computation. 


1 Introduction 


Morphological computation is discussed in various con¬ 
texts, such as DNA computing and self-assembly (see 


Pfeifer et al. 

2007b| 

Hauser et al. 

2012 


In this publication, we are interested in quantifying mor¬ 
phological computation of embodied agents which are 
embedded in the sensorimotor loop. Morphological com¬ 
putation, in this context, is described as the trade-off 


between morphology and control (Pfeifer and Scheier 


19991, which means that a well-chosen morphology, if 


exploited, substantially reduces the amount of required 
control (Montufar et al. 2014). Hereby, a morphology 


refers to the agent’s body, explicitly including all its 


physiological and physical properties (shape, sensors, ac¬ 
tuators, friction, mass distribution, etc.) ( Pfeiferf 2002). 
The consensus is that morphological computation is the 
contribution of the morphology and environment to the 
behaviour, that cannot be assigned to a nervous system 
or a controller. There are several examples from biol¬ 
ogy, which demonstrate how the behaviour of an agent 
relies on the interaction of the body and environment. 
A nice example is given by Wootton (1992 see p. 188), 
who describes how “active muscular forces cannot en¬ 
tirely control the wing shape in flight. They can only 
interact dynamically with the aerodynamic and inertial 
forces that the wings experience and with the wing’s own 
elasticity; the instantaneous results of these interactions 
are essentially determined by the architecture of the wing 
itself [...]” 

One of the most cited example from the field of embod¬ 
ied artificial intelligence is the Passive Dynamic Walker 
by McGeer (1990). In this example, a two-legged walk¬ 
ing machine preforms a naturally appealing walking be¬ 
haviour, as a result of a well-chosen morphology and en¬ 
vironment, without any need of control. There is simply 
no computation available and the walking behaviour is 
the result of the gravity, the slope of the ground and 
the specifics of the mechanical construction (weight and 
length of the body parts, deviation of the joints, etc.). 
If any parameter of the mechanics (morphology) or the 
slope (environment) is changed, the walking behaviour 
will not persist. In this context, we understand the ex¬ 
ploitation of the body’s and environment’s physical prop¬ 
erties as the embodiments effect on a behaviour. 

Theoretical work on describing morphological com¬ 
putation in the context of embodied artificial intel¬ 
ligence has been conducted by (jHauser et al. 2011 


Fiichslin et al., 2012). In this publication, we study an 


information-theoretic approach to quantifying morpho¬ 
logical computation which is based on an information 
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decomposition of the sensorimotor loop. This work is 
based on two of our previous publications in which we 
have investigated different quantifications of morpholog¬ 


the context of reinforcement learning (Sutton and Barto 


ical computation (Zahedi and Ay 2013) and derived a 


of Bertschinger et al. (2014) to the setting of Zahedi and 


prefer the notion of the system’s Umwelt (von Uexkuell 


1934 Clark 1996 Zahedi et al. 2010), which is the part 


(see e.g. Pfeifer et al., 2007a) we introduce the notion of 


1998), in which the environment is understood as every¬ 


general decomposition of a mutual information of three 
random variables into unique, shared, and synergistic in¬ 
formation ( Bertschinger et akf 2014). In our previous 
work (Zahedi and Ay 2013), we derived two concepts 


which both match the general intuition about morpho¬ 
logical computation, but showed different results. In this 
publication, we will apply the information decomposition 


Ay (2013) with the goal to unify the two previously de¬ 
rived concepts. 

The paper is organised in the following way. The next 
section discusses the sensorimotor loop and its represen¬ 
tation as a causal graph. The third section describes the 
bivariate information decomposition from |Bertschinger| 
et al. (2014). Based on the information decomposition. 


thing that cannot be controlled arbitrarily by the agent. 

The distinction between intrinsic and extrinsic is also 
captured in the representation of the sensorimotor loop 
as a causal or Bayesian graph (see Fig. [^. For simplic¬ 
ity, we only discuss the sensorimotor loop for reactive sys¬ 
tems. This is plausible, because behaviours which exploit 
the embodiment are usually better described as reactive 
and not as deliberative. The most prominent examples 
are locomotion behaviours, e.g. human walking, swim¬ 
ming, flying, etc., which are all well-modelled as reactive 
behaviours. 

The random variables S, A, and W refer to sensor, 
actuator, and world state, and the directed edges reflect 
causal dependencies between the random variables (see 
Klyubin et al.| [2004} |Ay and Polanil |2008[ [Zahedi et aL| 


2010). Everything that is extrinsic is captured in the 


the fourth section introduces the unique information as 
a measure for morphological computation in the sensori¬ 
motor loop. The fifth section presents numerical results, 
which are then discussed in the final section. An ap¬ 
pendix explains how we computed our measure of mor¬ 
phological computation. 

2 Sensorimotor Loop 

Our information theoretic decomposition of the mutual 
information requires a formal representation of the senso¬ 
rimotor loop, which we will introduce in this section. In 
our understanding, a cognitive system consists of a brain 
or controller, which sends signals to the system’s actu¬ 
ators, thereby affecting the system’s environment. We 


of the system’s environment that can be affected by the 
system, and which itself affects the system. The state of 
the actuators and the Umwelt are not directly accessible 
to the cognitive system, but the loop is closed as informa¬ 
tion about the Umwelt and the body is provided to the 
controller through the sensors. In addition to this gen¬ 
eral concept of the sensorimotor loop, which is widely 
used in the embodied artificial intelligence community 


world and by that we mean the system’s morphology and 
the system’s Umwelt. We can now distinguish between 
the intrinsic and extrinsic perspective in this context. 
The world is everything that is extrinsic from the per¬ 
spective of the cognitive system, whereas the controller, 
sensor and actuator signals are intrinsic to the system. 
This is analogous to the agent-environment distinction in 


variable W, whereas S and A are intrinsic to the agent. 
The random variables S and A are not to be mistaken 
with the sensors and actuators. The variable S is the 
output of the sensors, which is available to the controller 
or brain, the action A is the input that the actuators 
take. Consider an artificial robotic system as an exam¬ 
ple. Then the sensor state S could be the pixel matrix 
delivered by some camera sensor and the action A could 
be a numerical value that is taken by some motor con¬ 
troller to be converted in currents to drive a motor. 

Throughout this work, we use capital letter (A, Y, 
...) to denote random variables, non-capital letter (x, 
y, ...) to denote a specific value that a random variable 
can take, and calligraphic letters (A, 3^, ...) to denote 
the alphabet for the random variables. This means that 
Xt is the specific value that the random variable X can 
take a time f G N, and it is from the set Xt G X. Greek 
letters refer to generative kernels, i.e. kernels which de¬ 
scribe an actual underlying mechanism or a causal rela¬ 
tion between two random variables. 

We abbreviate the random variables for better compre¬ 
hension in the remainder of this work, as the information 
decomposition (see next sections) considers random vari¬ 
ables of consecutive time indices. Therefore, we use the 
following notation. Random variables without any time 
index refer to time t and hyphened variables to time f-|-1. 
The two variables W, W refer to Wt and Wt+i- 

Formally, the sensorimotor loop is given by the prob¬ 
ability distribution p{w) and the kernels a{w'\w,a), 
/3(s|w), and 7r(a|a). To analyse the quality of our de¬ 
rived quantification, it is best to evaluate them in a fully 
controllable setting. For this purpose, we chose the same 
parameterisable binary model of the sensorimotor loop 


that was used in our previous publication (Zahedi and 
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Figure 1: A formal model of the sensorimotor loop. 


Ay, 2013). It allows to control the causal dependencies 
of S, A, and W individually, and thereby, enables an 
evaluation of the information decomposition in the sen¬ 
sorimotor loop and compare it with our previous results. 
The model is shown in Figure and given by the follow¬ 
ing set of equations: 


o: 4 ,,,p,u:iw'\w, a) = 


/3<:(sH = 


^<pw'w-l-'ipw'a-l-cow'wa 

E acbw" 'W-\-'ipw" a-\-w" wa 
w"GQ, ^ 

Csw 




x(a|s) = 


Pr{w) = 


e™ 


( 1 ) 

( 2 ) 

(3) 

(4) 


where a,w,s,w' G = {±1} and 0, ■i/', w, /i, r > 0. As 


in (Zahedi and Ay 2013), the following two assumptions 


are made without loss of generality. First, it is assumed 
that all world states w G occur with equal probability, 
i.e. p(w = 1) = p(w = —1) = 1 / 2 . Second, we assume a 
deterministic sensor, i.e. C ^ 1 = ^swi which 

means that the sensor is a copy of the world state. The 
first assumption does not violate the generality, because 
it only assures that the world state itself does not already 
encode some structure, which is propagated through the 
sensorimotor loop. The second assumption does not vi¬ 
olate the generality of the model, because in a reactive 
system as in Figure]^ the sensor state S and A can be 
reduced to a common state, with a new generative ker¬ 
nel 7 (a|r(;) = 7r(a|s) o I3{s\w). Hence, keeping one of the 
two kernels deterministic and varying the other in the 
experiments below, does not reduce the validity of this 
model. This leaves four open parameters '(/>,(/), w, and /i, 
against which the morphological computation measure is 
validated. 


3 Information Decomposition 

Next, we introduce the information decomposition that 
underlies our measure of morphological computation. We 
first explain this information decomposition in a general 
information theoretic setting and later explain how we 
use it in the sensorimotor loop. 

Consider three random variables X, T, Z. Suppose 
that a system wants to predict the value of the ran¬ 
dom variable X, but it can only access the informa¬ 
tion in Y or Z. How is the information that Y and Z 
carry about X distributed over Y and Z? In general, 
there may be redundant or shared information (infor¬ 
mation contained both Y and Z), but there may also 
be unique information (information contained in only 
one of Y or Z). Finally, there is also the possibility 
of synergystic or complementary information, i.e. infor¬ 
mation that is only available when Y and Z are taken 
together. The classical example for synergy is the XOR 
function: If Y and Z are binary random variables and 
if A = YXORZ, then neither Y nor Z contain any in¬ 
formation about X (in fact, X is independent of Y and 
X is independent of Z), but when Y and Z are taken 
together, they completely determine X (in particular, X 
is not independent from the pair (A, F)). 

The total information that (A, F) contains about A 
can be quantified by the mutual information /(A : 
(F, F)). However, there is no canonical way to sepa¬ 
rate these different kinds of informations. Mathemati¬ 
cally, one would like to have four functions SI{X : Y-, Z) 
(“shared information”), UI{X : Y\Z) (“unique informa¬ 
tion of F”), UI{X : Z\Y) (“unique information of F”), 
CI{X :Y;Z) (“complementary information”) that sat¬ 
isfy 


/(A : (F, F)) = SI{X :Y-,Z) + UI{X :Y\Z) 

+ UI(X : Z\Y)yCI{X :Y-Z). (5) 


From the interpretation it is also natural to require 

MI{X : F) = SI{X ■.Y]Z) + UI{X : F \ F), 
MI{X : F) = SI{X :Y;Z) + UI{X : F \ F). 


( 6 ) 


A set of three functions S'/, C//, and Cl that satisfy ([^ 
and jUL is called a bivariate information decomposition 
by Bertschinger et al. (2014). It follows from the defining 


equations and the chain rule of mutual information that 
an information decomposition always satisfies 

MI{X : F|F) = UI{X ■.Y\Z) + CI{X : F; F). (7) 

Equations ([^ and ([^ do not specify the functions SI, 
UI, and Cl. Several different candidates have been pro¬ 


posed so far, for example by Williams and Beer (2010) 
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). We will use the decomposition sorimotor loop, we also expect to find aspects of redun- 

of I 

Bertschinger et al. 

(2014 

) that is defined as follows: dant, unique, and complementary influence of IF and A 


Let A be the set of all possible joint distributions of 
A, y, and Z. Fix an element P S A (the “true” joint 
distribution of A, A, and Z). Define 

Ap = G a : 

Q(X = x,Y = y)=PiX = x,Y = y) 
and Q(A = x, Z = z) = P(A = x, Z = z) 

for all X G A, y G 3^, z G z| 

as the set of all joint distributions which have the same 
marginal distributions on the pairs (A, A) and (A, Z). 
Then 

P/(A : A \ Z) = min /q(A : A|Z), 

QG Ap 

SI{X :Y;Z)= max CoIq{X-,Y-, Z), 

QG Ap 

CI{X : A; Z) = /(A : (A, Z)) - min /q(A : (A, Z)), 

Ap 

where Co/ denotes the co-information. Here, a subscript 
Q in an information quantity means that the quantity is 
computed with respect to Q as the joint distribution. 

One idea behind these functions is the following: Sup¬ 
pose that the joint distribution P of A, A, and Z is not 
known, but that just the marginal distributions of the 
pairs (A, A) and (A, Z) are known. This information 
is sufficient to characterize the set Ap, but we do not 
know which element of Ap is the true joint distribution. 
One can argue that the UI and SI should be constant 
on Ap; that is, shared information and unique informa¬ 
tion should depend only on the interaction of A and A 
and the interaction on A and Z, but not on the way in 
which the three variables interact. 

The second property that characterizes the informa¬ 
tion decomposition is that the set Ap contains a distri¬ 
bution Q such that CIq{X : A; Z) = 0. In other words, 
when only the marginal distributions of the pairs (A, A) 
and (A, Z) are known, then we cannot know whether 


there is synergy or not. See (Bertschinger et ah, 20141 


for a more detailed justification and a proof how these 
properties, determine the functions t//, S'/, and Cl. 


In Bertschinger et al. (2014), the formulas for UI, Cl, 


and SI are derived from considerations about decision 
problems in which the objective is to predict the out¬ 
come of A. Here, we want to apply the information 
decomposition in another setting: We will set A = W’, 
A = W, and Z = A. In our setting, W and A not 
only have information about W', but they actually con¬ 
trol W. However, the situation is similar: In the sen- 


on W'. Formally, since everything is defined probabilisti¬ 
cally, we can still use the same functions UI, Cl, and SI. 
We believe that the arguments behind the definition of 
UI, Cl and SI remain valid in the setting of the senso¬ 
rimotor loop where we need it. First, it is still plausible 
that unique and redundant contributions should only de¬ 
pend on the marginal distributions of the pairs (IF, W) 
and (A, IF'). Second, in order to decide whether IF and 
A act synergistically, it does not suffice to know only 
these marginal distributions. Therefore, we believe that 
the functions UI, Cl, and SI have a meaningful inter¬ 
pretation. In particular, we hope to be able to use the 
information decomposition in order to measure morpho¬ 
logical computation. This view is supported by our sim¬ 
ulations below, which indicate that the functions UI, 
Cl and SI do indeed lead to a reasonable decomposi¬ 
tion of I{W' : (H, IF)) and that the unique information 
UI{W : W'\A) is a reasonable measure of morphological 
computation, at least in our simple model of the senso¬ 
rimotor loop. 

The parameters of our model of the sensorimotor loop 
(Eqs § to @) can also be interpreted in terms of an 
information decomposition. Intuitively, (/) corresponds 
to the unique influence of IF on IF', -0 corresponds to 
the unique influence of A on IF', and w corresponds 
to the complementary influence. However, the role of 
the additional parameters C, /i, r is not so clear, and 
it is not so easy to find a correspondence of redun¬ 
dant information. The information decomposition has 
the advantage, that its definition does not depend on a 
parametrization. Observe that if the “synergistic param¬ 
eter” w = 0 vanishes, then it does not necessarily follow 
that CI{W' : A; IF) = 0 (see Fig.[^. However, we do ex¬ 
pect the complementary information to be small in this 
case. 

4 Morphological computation 

Morphological computation was described as the contri¬ 
bution of the embodiment to a behaviour. In our previ¬ 
ous work, we derived two concepts to quantify morpho¬ 
logical computation, which are both based on the world 
dynamics kernel a{w'\w,a). 

The first concept assumes that the current action 
A has no influence on the next world state IF', in 
which case the kernel a{w'\w,a) reduces to a{w'\w). 
If this is the case, we would say that the sys¬ 
tems shows maximal morphological computation, as 
the behaviour is completely determined by the world. 
To measure the amount of morphological computa- 
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tion present in a recorded behaviour, we calculated 
how much the data differed from the assumption by 
calculating the weighted Kullback-Leibler divergence 
aPi'^^ cL)DKLio'{w'\w, o)||jw)), which is the con¬ 
ditional mutual information I{W' : Because this 

quantity is zero if we have maximal morphological com¬ 
putation, we inverted and normalised in the following 

way: 1 — I{W' : A\W) / \og 2 \W\- 

The second concept started with the complementary 
assumption that the current world state W had no influ¬ 
ence on the next world state Vb', i.e., that the world dy¬ 
namics kernel is given by a{w'\a). Morphological compu¬ 
tation was then quantified as the error from the assump¬ 
tion, given by the weighted Kullback-Leibler divergence 
aPi'^^ cL)DKLio'{w'\w, o)||Q!(?w'|fl)), which equals the 
conditional mutual information I{W' : 1T|A). 

Both concepts were analysed and quantifications were 
derived, which didn’t require knowledge about the world, 
but could be calculated from intrinsically available in¬ 
formation only. At that time, we could not determine 
which of the two concepts would capture morphological 
computation best, although both concepts and their in¬ 
trinsic adaptations lead to different results in a specific 
configuration {-0 = (/> « 0). 

Our intention in this publication is to answer this ques¬ 
tion. For this purpose, we follow a different approach 
to quantify morphological computation, by starting with 
the mutual information of I(W' : {W, A)) and decompose 
it into the shared, unique and synergistic information, as 
described in the previous section. Rewriting the Equa¬ 
tion ([^, by replacing X, Y, Z by W, W, A, we obtain the 
following information decomposition: 

I(W' : (W, A)) = SI{W' ■.W;A) + UI{W' :W\A) 

+ UI{W' : A \ VF) -f SYiW' : W; A) 

( 8 ) 

As show in Equation Q, our previous concept two, the 
conditional mutual information I{W' : VF|A) is given by 
the sum of the unique information UI{W' : IE \ A) and 
the synergistic information CI{W' : IE; A): 

I{W' : 1E| A) = UliW' : fE \ A) + CI{W' : IE; A). (9) 

The examples we have discussed in the introduction (in¬ 
sect wing and Passive Dynamic Walker) suggest to use 
the unique information UI{W' : 1E\ A) to quantify mor¬ 
phological computation, because it captures the infor¬ 
mation that the current and next world state IE, IE' 
share uniquely. The next section presents numerical 
simulations to investigate how the conditional mutual 
information I{W' : 1E|A) and the unique information 
UI{W’ : IE \ W) compare with respect to quantifying 
morphological computation. 


5 Experiments 


The experiments in this section are conducted on the 
parameterised model of the sensorimotor loop that was 
introduced in the second section (see Eig. and Eq. 0 
to Eq. 0). As stated earlier, we set r = 0, which means 
that the world state IE is drawn with equal probability 
{p{w = —1) = p{w = 1 ) = 1/2), and C such that the sen¬ 
sor state S' is a copy of the world state IE. This leaves 
four parameters for variation, namely the three world 
dynamics kernel parameters (j), ip, lo and the policy pa¬ 
rameter ji. We decided to plot the information theoretic 
quantities only for ^ = 0 (see Fig. and Fig. |^, i.e., for 
the case, in which the action A is chosen independently 
of the current sensor value S and with equal probabil¬ 
ity. This allows us to investigate the effect of the action 
A on the next world state IE', without any influence of 
IE on A. We also know from previous experiments (see 
Zahedi and Ay [2013 ), that the conditional mutual in¬ 
formation I{W' : 1E|A) drops to zero for increasing /i. 
As the conditional mutual information is the sum of the 
unique and synergistic information, we know that both 
quantities will also decrease with increasing p. If A is de¬ 
terministically dependent on IE, it also follows that the 
unique information UI{W' : A\ IE) is zero, because A 
and IE are interchangeable. The only quantity that will 
be larger than zero is the shared information, which, by 
definition, is not of interest in the context of this work. 

We decided to plot the information decomposition for 
varying (p (parameter of unique influence of IE on IE') 
and Ip (parameter of unique influence of A on IE') for two 
different values of uj (parameter of synergistic influence 
of W, A on IE', see Eq. Q). Figureshows the results 
for u) = 0, while Figure shows the results for oj = 2. 
We will first discuss the results for uj = 0.0, as they are 


best comparable with our previous results from (Zahedi 


and Ay 2013). 


Vanishing synergistic parameter (w = 0): Fig¬ 
ure shows that synergistic information CI{W' : IE; A) 
is small and only present if ip cp (diagonal of the im¬ 
age). This is in agreement with our intuition that uj is 
the synergistic parameter. The unique information of 
the action A and the next world state IE', denoted by 
UI(W' : A \ IE), is shown in Figure [^. The plot re¬ 
veals that the unique information of the current action 
A and the next world state IE' is only present, when¬ 
ever Ip > p, and it is large, whenever ip is significantly 
larger than p. Figure shows analogous results for the 
unique information UI{W' : IE \ A). In this case, the 
unique information is negligible, whenever p p and 
grows whenever p is significantly larger then p. These 
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Figure 2: Information decomposition for ^ = 0.0, w = 0.0 Figure 3: Information decomposition for fj. = 0.0, w = 2.0 


two plots show that the definition of the unique informa¬ 
tion, as proposed by Bertschinger et al. (20141, is able 
to extract the unique influence in a setting in which two 
random variables actually control a third random vari¬ 
able. Fig. HP shows the conditional mutual information 
I{W' : IF|.4), which was the second concept of quan¬ 
tifying morphological computation in our previous work 
( Zahedi and Ay[ 20131. As stated earlier, the conditional 
mutual information is given by the sum of the unique 
and synergistic information (Eq. ©)• Hence, there is 
almost no difference between Figure and Figure [^, 
except on the diagonal, where the unique information 
UI{W' : IF \ A) drops faster to zero. 


Positive synergistic parameter (w = 2): To study 
the difference between the unique information UI{W' : 
IF \ A) and the conditional mutual information I{W' : 
IF I A), and hence, compare the new quantification with 
our former concept, we conducted the same experiments 
with a value of w = 2 (see Figures and i- Fig- 
ures HP -C demonstrate how the information decompo¬ 
sition can distinguish between the synergistic informa¬ 
tion and the unique informations, which is exactly what 
we need to quantify morphological computation. The 
unique information UI{W' : IF \ A) captures only the 
information that the current world state IF and the next 
world state IF' share, and therefore, captures the com¬ 
mon understanding of morphological computation in the 
context of embodied artificial intelligence. In the in- 


I(W':W|A) U(W';W\A) 



Figure 4: Difference between I{W' : IF|A) and UI{W’ : 
IF \ A) for w = 2. 


troduction, we presented two examples of morpholog¬ 
ical computation, which described it as the contribu¬ 
tion of the body and environment to a behaviour, that 
cannot be assigned to any neural system or robot con¬ 
troller. The unique information UI(W' : IF \ A) Fig¬ 
ure captures this notion of morphological computa¬ 
tion best, because it vanishes if the synergistic informa¬ 
tion C/(IF' : IF;A) (see Fig. [^) or the unique infor¬ 
mation [//(IF' : A\ IF) (see Fig. [3p) increases. Given 
Eq. ([^, it is clear that the conditional mutual informa¬ 
tion /(IF' : IF|A) is positive (see Fig.|3p) whenever the 
unique information [//(IF' : A \ IF) or the synergistic 
information C/(IF' : IF; A) is positive. This is problem¬ 
atic for the following reason. Figure |3p) show a positive 
conditional mutual information I{W' : IF|A) also for val¬ 
ues oi 'ijj > (j), which is counter-intuitive. Furthermore, 
as Figure]^ shows (note that the (pip axes are rotated for 
better visibility), the conditional mutual information is 
indifferent for a large range oi {(j) — 'ip\ < d. Additionally, 
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the conditional mutual information increases for vanish¬ 
ing (j) and tp, which again is counter-intuitive, whereas 
the unique information UI{W' : fF \ A) (see right-hand 
side of Fig. nicely reflects our intuition. Therefore, we 
conclude that the unique information UI{W' : IT \ A) 
is best suited to quantify morphological computation in 
the context of embodied artificial intelligence. 

6 Discussion 

This work proposes a quantification of morphological 
computation based on an information decomposition in 
the sensorimotor loop. In the introduction, morphologi¬ 
cal computation was described as the contribution of an 
agent’s body and agent’s Umwelt to its behaviour. Im¬ 
portant to note is that both mentioned examples high¬ 
lighted the contribution of the embodiment that resulted 
solely from interactions of the body and environment and 
that cannot be attributed to any type of control by the 
agent. This is why we propose to use a decomposition 
of the mutual information I{W' : {W,A)) into shared, 
unique and synergistic information. This allows us to 
separate contributions of the embodiment from contri¬ 
butions of the controller (via its actions A) and contri¬ 
butions of both, controller and embodiment. 

We showed that the information decomposition is re¬ 
lated to our previous work in the following way. The 
sum of the unique information UI{W' : IT \ ^) and the 
synergistic information CI{W' : IF; A) is equivalent the 
conditional mutual information I{W' : IF|>1), which is 
one of the two earlier concepts for morphological com¬ 
putation. This relation shows the difference of this work 
compared to our former results. We are now able to 
quantify exactly how much of the next world state IF' 
is determined by the current world state IF, thereby ex¬ 
cluding any influence of the action A. Therefore, we 
proposed UI{W' : IF \ A) as a quantification of morpho¬ 
logical computation. 

In two numerical simulations, we evaluated the decom¬ 
position in a parametrised, binary model of the sensori¬ 
motor loop. The world dynamics kernel a{w'\w,a) was 
parametrised with three parameters, (p, tp, and w, which 
roughly relate to the unique information UI{W' : IF\A), 
the unique information UI{W' : A \ IF), and the syner¬ 
gistic information C/(IF' : IF;^). For a fixed value of 
w, the two parameters (p and p were varied to evaluate 
the information decomposition in the sensorimotor loop. 
It was shown that for a vanishing synergistic parame¬ 
ter w = 0, synergistic information was present only for 
p Ip. This explains why there is only a marginal differ¬ 
ence between UI{W' : IF \ A) and J(IF' : W\A) in this 
setting. For a positive synergistic parameter w = 2, we 


saw that the synergistic information was positive for a 
much larger domain, which led to a significant difference 
between UI(W' : IF \ >1) and J(IF' : W\A). In particu¬ 
lar, the condition mutual information J(IF' : IF|^) was 
positive for a larger range of parameter values p and p. 
There is a domain \p — p\ < d, for which the conditional 
mutual information I{W' : IF|A) is positive and indif¬ 
ferent. One would expect to see a higher morphological 
computation mostly when p > despite the fact that 
synergistic information is present. This shows that the 
[//(IF' :W\A) is better suited to quantify morpholog¬ 
ical computation. 

In Zahedi and Ay (2013) it was proposed that a mea¬ 
sure of morphological computation could be used as a 
guiding principle in an open-ended self-organised learn¬ 
ing setting. For this to work, this measure should 
only depend on information that is intrinsically avail¬ 
able to the system. Clearly, this is not the case for 
[//(IF' : IF \ A). Therefore, future work will include 
derivations of the information decomposition, which only 
include intrinsically available information. It would also 
be interesting to investigate how much a formalisation of 
the information decomposition can benefit from a con¬ 
sideration of the causal information flow. The starting 
point for our decomposition was the mutual information 
/(IF' : (IF', A)), which is a correlational measure and 
not a measure of causal dependence, as e.g. proposed by 
Pearl (2000). In currently ongoing work, we are apply¬ 


ing the quantihcation to motion capturing data of real 
robots. 
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A Appendix: Computing UI, SI, 
and Cl. 


In this appendix we shortly explain how we computed 
the functions UL and CL. The appendix of |Bertschinger| 


et al. (20141 explains how to parametrize the set Ap and 


how to solve the optimization problems in the defini¬ 
tions of UI, CL, and SI. In our case, where all variables 
are binary, Ap consists of all probability distributions 
(57 _i,7+i with 


w' 

w 

a 

,w,a) 

-1 

-1 

-1 

P{w', w, a) + 7-1 

-1 
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P{w',w,a) - 7-1 

-1 

+1 

-1 
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+1 
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+1 

-1 
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-1 
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+1 

+1 

-1 

P{w',w,a) - 7+1 

+1 

+1 

+1 

P{w',w,a) + 7+1 


The range of the two parameters 7 ±i is restricted in such 
a way that (57i_,7^i has no negative entries. Since every 
entry (w', w, a) involves only one of the two pa¬ 

rameters, Ap is a rectangle, bounded by the inequalities 
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von Uexkuell, J. (1957 (1934)). A stroll through the 
worlds of animals and men. In Schiller, C. H., editor, 
Instinctive Behavior, pages 5-80. International Univer¬ 
sities Press, New York. 


max{-P(-l,-l,-l),-P(-l,-Hl,-kl)} < 7 - 1 , 
min{P(-l, -1, -hi), P(-l, +1,-1)} > 7-1, 
max{-P(-hl, -1, -1), -P(+l, +1, +1)1 < 7 + 1 , 
min{P(-|-l, —1,+1),P(+1,+1,—1)1 > 7 + 1 . 

To approximately solve the optimization problem we 
computed the values on a grid and took the optimal 
value. This simple procedure yields an approximation 
that is good enough for our purposes. 









